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ABSTRACT —Taxonomic analysis of macromycete fruiting bodies is a challenging task 
that utilizes morphological, biochemical, and molecular methods. Many biochemical and 
molecular methods have been developed to test or confirm identifications or phylogenetic 
positions independently of morphological data. SDS electrophoresis has been shown to be a 
good biochemical method for protein separation. Although protein profiles can be analyzed 
by commercially available software, there is no software designed specifically for fungal 
taxonomic research. We have developed an open source portable program that uses protein 
profiles of fungal fruiting bodies to calculate relative differences between species for use in 
generating to generate more accurate phylogenetic trees. 
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Introduction 

The challenging task of macromycete taxonomy uses morphological, 
biochemical, ecological, and molecular data (Guarro et al. 1999, Blackwell et 
al. 2006, Korabecna 2007, Lutzoni & Vilgalys 1995). Sodium dodecyl sulfate 
(SDS) electrophoresis is a good biochemical method for protein separation 
and biomarker discovery, which when applied to the proteomes of fungal 
fruiting bodies gives a good number of separated proteins, revealing small (or 
sometimes large) differences in protein expression (Materi¢ 2012). The method, 
which is robust and reliable, could serve as an independent taxonomic tool for 
fungi (Guarro et al. 1999, Tyrrell 1969). 
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The aim of this work was to develop a user-friendly open source program 
using electrophoresis data for fungal taxonomy. Such a program would compare 
and calculate relative differences between taxa based on results from SDS gel 
electrophoresis and similar techniques (e.g., western blot, zymography). 


Materials & methods 

We sampled and identified 21 fruiting bodies representing differently related 
species. Proteins were separated according to the method described in Materic (2012), 
and protein profiles were obtained for each species. The quality of protein profiles 
depends upon many factors during sample homogenization, protein extraction and 
electrophoresis, and an optimized protocol can be found in Materic¢ et al. (2012). In this 
work we have chosen to show the results of only seven species: however, more detailed 
dendrograms are available in Materic (2012). 

Protein profile data are stored in .csv format and examples of files can be found along 
with the source-code in Materi¢ (2013). Protein profile files include two sets of data: 
(1) molecular weights (MW) of proteins; and (2) relative abundance. MW and relative 
abundances can be obtained using specialized software such as TotalLab (Phoretix, 
Newcastle, United Kingdom). However, abundances should be recalculated as relative 
abundances with range of 1 (least abundant) to 5 (most abundant). In order to create 
a distance matrix table, protein profiles are compared by distancelD. The distance 
matrix tables are transformed into dendrograms using the program package PHYLIP 
(Felsenstein 2002). 


Results & discussion 

Protein electrophoresis, which has been widely used in solving taxonomic and 
systematic problems, produces results that are independent of morphological 
and molecular data (Guarro et al. 1999, Tyrrell 1969). Thus, it is crucial to 
have appropriate software that deals with taxonomic issues rather than with 
pure biochemical properties. Apart from distance1D, there is no such software 
specifically designed to interpret fungal taxonomy from protein profile data. 

Distance1D is a program designed to calculate relative differences between 
two protein profiles, which could then be used to generate dendrograms 
(Fic. 1). The program is designed and written specifically for use in fungal 
taxonomy (mainly for analyzing protein profiles of fungal fruiting bodies), but 
its usage could be wider. The program is meant to be open source, user-friendly, 
and portable. Portability results from the program's being written in Perl, which 
can be run on any operating system. As the program uses the Tk library for 
generating widgets (Windows-like interface), Perl/Tk should be installed along 
with Perl. 

Our previous work produced good protein profiles of fungal fruiting bodies 
from SDS gel electrophoresis (Materi¢ 2012) that are stored in separate files 
(Fic. 1). 
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FIGURE 1. Diagram showing stages of an experiment used in order to gain biochemically based 
phylogenetic trees. A: Fruiting bodies. B: Result of SDS electrophoresis (channels: M = molecular 
mass marker; 1 = Boletus luridus; 2 = B. edulis; 3 = Suillus granulatus; 4 = Russula foetens; 
5 = Lactarius piperatus; 6 = R. cyanoxantha; 7 = R. emetica; Materić et al 2012). C: Interface of 
the program distance1D. D: Distance matrix table generated by the program. E: Phylogenetic tree 
drawn by PHYLIP program package (similar results were obtained by analyzing ITS II regions of 
rDNA as control; Materié 2012). 


The program as an input takes the paths of two files (two file names) where 
protein data are stored in .csv format. An algorithm searches for optimal pairing 
of proteins by looping through the parameters, such as MW sensitivity. After 
alignment and optimization, the following differences are scored: (1) presence 
of new protein lines (new protein), (2) molecular weight difference of protein 
pairs, and (3) abundance difference of protein pairs. The scoring system, 
which could be adapted according to user requirements, is set up as follows: 
“10” scored for each new protein line, “1” scored for each 1kDa MW shift, and 
“2” scored for abundance difference. If DNA/RNA gel data are going to be used, 
the abundance difference for each fragment should be set the same. 

There are several important advantages to using this program: the program 
generates distances that give accurate phylogenetic trees (more accurate than 
a single DNA sequence such as rDNA ITS1; Materi¢ 2012); the parameters 
can be changed to suit particular research; the program can be easily adapted 
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for use with any protein/DNA/RNA electrophoresis data; and the program is 
suitable for all operating systems and is open source and free. 
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